DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index
نویسندگان
چکیده
In Smart Grid applications, as the number of deployed electric smart meters increases, massive amounts of valuable meter data is generated and collected every day. To enable reliable data collection and make business decisions fast, high throughput storage and high-performance analysis of massive meter data become crucial for grid companies. Considering the advantage of high efficiency, fault tolerance, and price-performance of Hadoop and Hive systems, they are frequently deployed as underlying platform for big data processing. However, in real business use cases, these data analysis applications typically involve multidimensional range queries (MDRQ) as well as batch reading and statistics on the meter data. While Hive is high-performance at complex data batch reading and analysis, it lacks efficient indexing techniques for MDRQ. In this paper, we propose DGFIndex, an index structure for Hive that efficiently supports MDRQ for massive meter data. DGFIndex divides the data space into cubes using the grid file technique. Unlike the existing indexes in Hive, which stores all combinations of multiple dimensions, DGFIndex only stores the information of cubes. This leads to smaller index size and faster query processing. Furthermore, with pre-computing user-defined aggregations of each cube, DGFIndex only needs to access the boundary region for aggregation query. Our comprehensive experiments show that DGFIndex can save significant disk space in comparison with the existing indexes in Hive and the query performance with DGFIndex is 2-50 times faster than existing indexes in Hive and HadoopDB for aggregation query, 2-5 times faster than both for non-aggregation query, 2-75 times faster than scanning the whole table in different query selectivity.
منابع مشابه
Smart Grid Unit Commitment with Considerations for Pumped Storage Units Using Hybrid GA-Heuristic Optimization Algorithm
A host of technologies has been developed to achieve these aims of the smart grid. Some of these technologies include plug-in electric vehicle, demand response program, energy storage system and renewable distributed generation. However, the integration of the smart grid technologies in the power system operation studies such as economic emission unit commitment problem causes two major challen...
متن کاملThe Influence of Smart Grid on TOU Programs With Respect to Production Cost and Load Factor, A Case Study of Iran
Reaching an electricity system which is both economically efficient and environmentally friendly is motivating countries to design and execute different types of TOU demand response programs. But there are certain deficiencies which prevent these programs to effectively modify the load shape. Smart grid as a means could help the electricity system to reach the highest demand side management ...
متن کاملOptimal Power Flow in the Smart Grid Using Direct Load Control Program
This paper proposes an Optimal Power Flow (OPF) algorithm by Direct Load Control (DLC) programs to optimize the operational cost of smart grids considering various scenarios based on different constraints. The cost function includes active power production cost of available power sources and a novel flexible load curtailment cost associated with DLC programs. The load curtailment cost is based ...
متن کاملA Triple State Time Variant Cost Function Unit Commitment with Significant Vehicle to Grid Penetration
Hastening the power industry toward smart operation juxtaposed with the unrivaled restructuring and privatization agendas, some of the ubiquitous smart grid advantages are glanced more and more. Recently, the vehicle to grid (V2G) technology, as one of these beneficial aspects, has found a worldwide attention due to its important advantages. The V2G technology can raise the system operation eff...
متن کاملEnhancing Smart Grid with Session-Oriented Communication System to Truly Support Reliability and Robustness
Environmental sustainability issues and the costs of new power generation and transmission have increased the interest in evolving current power grid to new technologies. The Smart Grid is a promising technology, since it allows a distributed computing approach with potentials for self-diagnosing/-healing, reliable multi-user communication and fast hard real-time control. However, the missing s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PVLDB
دوره 7 شماره
صفحات -
تاریخ انتشار 2014